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The Lagrange Inversion Theorem 
in the Smooth Case 1 

by Harold R. Parks and Steven G. Krantz 

Abstract: The classical Lagrange inversion theorem is a con- 
crete, explicit form of the implicit function theorem for real ana- 
lytic functions. The authors derive a suitable version of this result 
for C°° functions. Along the way, they Gnd a new asymptotic for 
smooth functions. 

1 The Problem 

The implicit function theorem has a long and colorful history. Finding its 
provenance in considerations of problems of celestial mechanics (as studied 
by Lagrange and Cauchy, among others), the result was at first a rather 
primitive observation about monotone functions on R 1 . Over time, the result 
was extended to N variables, and the monotonicity hypothesis was replaced 
by the now more familiar assumption of nondegeneracy of the Jacobian at a 
point (see [KPb] for a more detailed history). 

It is easy to imagine that there were a number of vestigial forms of the 
implicit function theorem that historically preceded the crisp result that can 
be found in textbooks today. One of these is the so-called Lagrange inversion 
theorem. 2 That is the topic of the present paper. The classical Lagrange 
theorem is about analytic functions. Our purpose here is to extend the 
result to C k or C°° functions. 

The form of the Lagrange inversion theorem that we will consider in this 
paper concerns solving 

y = x + f(y) (1) 
for y as a function of x. Lagrange's result is that, if /(0) = 0, |/'(0)| < 1, 



lr rhe authors are happy to thank the American Institute of Mathematics for its hospi- 
tality during the writing of this paper. 

2 Lagrange's inversion formula can be found in [LAG]. The date of the memoir is 1768, 
but it was published in 1770 in Histoire de L 'Academie Royal des Sciences et Belles- Lettres. 
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and / is real analytic, then 



n=l N 7 

oo / rl \ n 

= * + '<*> + EfrTT)i(!) {/*"<*>>■ (2) 

n=l v / \ / 

One can obtain (2) from [KPb; Theorem 2.3.1] by complexifying /, substi- 
tuting 0(2) = f(z), and taking ^(z) = z, t — 1. A version of the Lagrange 
theorem in the context of real analysis, for smooth functions, is to be found 
in [GRO]. The arguments presented there seem to be incomplete, and the 
statement of the result incorrect. The purpose of the present paper is to 
explore these matters further and to set the record straight. 

We will find it useful to combine the Lagrange inversion formula with the 
formula of Faa di Bruno. 3 The formula of Faa di Bruno tells us that if / and 
J are open intervals in R and / : / — > J and h : J — > R are C°° functions, 
then the nth derivative of h o f is given by 

(/i o /)<»>(*) = 



k 1 \k 2 \...k n \ ^w/\^ v j y 2 , j ■■■y n y 

where k = k\ + k 2 + ■ ■ ■ + k n and the sum is taken over all k\, k 2 , ■ ■ ■ , k n for 
which ki + 2k 2 + • • • + nk n = n. 

In particular, when h(t) = t n+1 , we have 



h {k \t) 



(n + l)\/(n+l-k)\]t n+1 ~ k if A; < n + 1, 
if n + 1 < k. 



Faa di Bruno's formula then becomes 

yv +i (*)}(*)= 

v n\ (n + 1)! ( fQ\t) \ k ° ( fW(t) \ kl ff^W ^ 
^k \k l \...k n \\ 0! J V I' J "V n\ 



(4) 



3 Faa di Bruno's formula first appeared in [FDB] — for a proof, see Section 1.3 of [KPa]. 
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where k = (n + 1) — (ki + k 2 + • • • + k n ) and the sum is taken over all 
ki, k 2 , ■ ■ ■ , k n for which k\ + 2k 2 + • • • + nk n = n. Accordingly, Lagrange's 
inversion formula can be written 



s = I+ / (l ) + E»!Enphr . (5) 

n =l i=o l ' \ ' 



where the inner sum is taken over all k±,k 2 , ■ ■ ■ ,k n for which k\ + 2/c 2 + • • • + 
n/c„ = n and where /co is defined by k = {n + 1) — {k\ + k 2 + • • • + /c n )- 
We can rewrite (5) as 

m 

y = x + f(x) + C k0 _ km .[f{x)p.\{[f%x)] k \ (6) 

(fe ,...,fe m )es «=i 

where 

oo n 

S = (J (J I (fco, fci, • • • ,k m ) ■ < ki, for i = 1, . . . ,m - 1, 



n=l m=l 



1 < A; m , 

fc = (n + 1) - (£4 + k 2 H h k m ), 

ki + 2A; 2 H h m/c m = n j , 



and where 

(h + 2k 2 + ■ ■ ■ + mk m )\ 



n _ ^i-i-^/i2-i r »^mj' l r , 

°fc ,...,fc m - r-j 11 fc 



j=l 



for (/c , • • • , G S. Note that all the Cfc ,...,fc m are positive. 

2 A Counterexample in the C°° Case 

We will now show that the Lagrange inversion formula is not true in the C°° 
category. 

To begin with, we let / be a real analytic function defined on an open 
interval containing [0, 1] and such that 

(a) /(0) = 0, 
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(b) < /'(O) < 1, 

(c) for each non-negative integer m, < f^ m \t) holds for all < t. 
Note that, by (b), we have 

< fit) < t for t e (0, 1) . (7) 

Many such functions exist. For example, we could take f(x) = \ e x — A, 
where < A < 1. 

Since / is real analytic, the Lagrange inversion formula is valid in an open 
interval / containing 0. 

Next, we will define sequences {ae} and {be} in / that satisfy 

b > a > bi > ai > • ■ • be > a e > ■ ■ ■ 
b e = a e + f{be) . 

In fact, we may choose any b in / with < b < 1. We set a = b — f(b ), 
noting that (7) implies that < a < b . Proceeding inductively, we observe 
that ii bo > a > bi > ax > ■ ■ ■ be > ae have already been chosen, then we 
may choose any be+i with < b i+ i < ae and set 

ae+i = b e+1 - f(b e+ i) . 

Again we use (7) to conclude that < ae + i < be+\- 
The Lagrange inversion formula (6) tells us that 

m 

be = a e + f(a e ) + £ C kh ..., km ■ [ f(a e ) } k ° ■ ]J [ /« (a e ) } K (8) 
(fc ,...,fcm)es i=i 

holds, for t = 0, 1, ... . 

Now we consider a C°° function g : R — > R such that 

(d) (?(M=0for£ = 0,l,..., 

(e) g(a t ) > for £ = 0, 1, ... , 

(f) g {m \a e ) > for £ = 0, 1, . . . and m = 1, 2, . . . . 
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[We consider in Lemma 2.1 and Remark 2.2 why such a function g exists.] 
Then we have 

be = a e + (f + g)(b e ). (9) 

If the Lagrange inversion formula held for the function / + g in any neigh- 
borhood of 0, then it would hold at infinitely many of the pairs (ae, be) in 
(9). For such a pair (ag, be), the Lagrange inversion formula for the function 
f + g would tell us that 

m 

be = ae + (f+g)(ae)+ £ C ko _ km - [ (/ + g)(a e ) } k ° J[ [ (/ + gf\a t ) ] ki 

(fc ,...,fcm)€3 1=1 

(10) 

holds. But, in fact, (10) fails to hold for any choice of I. This is so because, 
while the lefthand sides of (8) and (10) are equal, the righthand side of (10) 
consists of the righthand side of (8) plus nonnegative terms, among which is 
the positive term g(ag). 

Finally, it remains to show that a function g satisfying the above condi- 
tions exists, but this fact follows from the next lemma. 

Lemma 2.1 Let {i n }" =1 be a strictly decreasing sequence of positive num- 
bers with lim^oo x n = 0. //, for each n = 1, 2, . . . , there is given a sequence 
{°"n,fc}£Lo — { — I; 0> 1 }> then there exists a C°° function g : H. — > R such 
that 



sgn 



^ (fc) (^n 



= CF n ,k, for n = 1,2,... and k = 0,1,..., (11) 
5 «(0) = 0, /or n = 1,2,.... (12) 



Proof. Let h,l2, ... be pairwise disjoint open intervals with x n e /„. As- 
sume also that I\ is bounded. The other intervals are automatically bounded, 
because /„ C (x n+ i, holds for n = 2, 3, . . . . 

An old theorem of Emile Borel (see page 44 of [BOR] or else [KPa]) tells 
us that, for each n, there exists a C°° function <p n with 

<f> { *\x n ) = a nik , for A; = 0,1,.... (13) 

We may assume that <p n has compact support contained in I n . 
We will define 

oo 

g(x) = J2^Mx), (14) 
k=i 
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where the positive numbers £ n will be chosen large enough to insure that g 
is C°° . Specifically, it is easy to see that the choices 

in = e 1/x " +1 ■ sup { \</>%\x)\ :xel m , 1 < m < n, < k < n) 

will suffice to guarantee that g is C°° and that (12) holds. 

It is clear that (13) and (14) insure that (11) holds. ■ 

Remark 2.2 Lemma 2.1 is applied to construct a function g satisfying (d), 
(e), and (f) by setting 

X2i+i = bi, o"2£+i,o = 0, x 2 £ +2 = a e , o"2£+2,o = 1 , for £ = 0,1,2,... 
and setting 

o~n,k = for n — 1, 2, . . . and k — 1, 2, . . . . 



3 A Positive Result in the C N Case 

It is reasonable to conjecture — and we will show it to be true — that, if / is 
C N , then the function y satisfying y = x + f(y) is well approximated by the 
truncated Lagrange formula of order N. By the later we mean that part of 



n=l 



dx 



that only contains derivatives of / of order iV or smaller (the truncated 
Lagrange formula is given below in (27)). 
We need a lemma. 

Lemma 3.1 Suppose f(x) is N times continuously differentiable, N > 1, in 
a neighborhood of x = with /(0) = and |/'(0)| < I. Let P(x) be the 
Taylor polynomial of degree N for f at 0. If y — y(x) satisfies 

y = x + f(y) 

in a neighborhood of 0, then 



n=l 



= o(x N ) 



as x 



0. 
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Remark 3.2 Because P(x) is the Taylor polynomial of degree N for / at 0, 
only derivatives of / of order N or less are involved in the construction of P. 
Thus we see that the expression 



z + th ft)""™ 

n=l v 7 



appearing in the lemma involves only derivatives of / of order N or less, but 
note that those derivatives are evaluated at rather than at x as are the 
derivatives in Lagrange's formula. 

Proof of the Lemma. By Taylor's theorem, we may write 

f(x) = P(x) + R(x), (15) 

where 

R(x) = o(x N ) asx^O. (16) 

Since / is at least C 1 near 0, we can apply the implicit function theorem 
to see that there exists a function y — y(x) satisfying 

y = x + f(y). (17) 

Implicit differentiation shows us that y'(0) — 1/[1 — /'(0)]. Thus we have 

y = 0(x) as x -»• 0. (18) 

Combining (15) and (17), we have 

y = x + P(y) + R(y). (19) 

Since P(x) is an analytic function, we may apply the Lagrange inversion 
theorem to see that the function z = z(x) given by 



jxi f j x n-1 

n=l 



satisfies 

z = x + P(^). (21) 

As before, we have 

z = 0(x) as x -»• 0. (22) 
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Note that (20) gives us 

/ oo 

»- i^i {P n (x)} \ = y - * ■ (23) 



n=l N 7 / 



Now consider an x near for which 7^ 2(2;). Applying the mean 

value theorem to 

Q(t) = t - P(t) 

on the interval [min{y, z}, max{?/, z}], we obtain £ in that interval with 
(y - z) - [P(y) - P{z)] = Q{y)-Q{z) 

= Q'H){y-z) 

= [i-J*(01 (*-*)• (24) 

By (19), (21), and (24), we have 

R(y) = [y-x-P{y)]-[z-x-P{z)\ 
= (y - z) - [P(y) - P(z)\ 

= [1 - p'(0] (y - z) . (25) 

By (16) and (18), we have R(y) = o(x N ), so we see from (25) that 

[1 - P'(0] {y~z)= o(x N ) . (26) 

Since P'(0) = f'(0) < 1 and since ^ -> as x -> [by (18) and (22)], we 
conclude from (26) that y — z = o(x N ). The result follows from (23). ■ 

Theorem 3.3 Suppose f(x) is N times continuously differentiable, N > 1, 
in a neighborhood of x = with f(0) = and |/'(0)| < 1. If y = y(x) 
satisfies 

y = x + f(y) 

in a neighborhood of 0, then 



y - 



x + f(x) + Yl ■ [ f{x) ] k °-f[[ f (l) (x) } * 

(fc0,...,fc m )e3jv i=1 



o(x N ) 
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as x — > 0. Here 

00 min{n,JV} 

"AT 



00 uiiuv")"/ 

U U {(k ,ki,...,k m ) : < hi, fori = l,...,m- 1, 



ra=l m=l 

1 < k 



mi 



k = (n + 1) - (fci + fc 2 H h fc m ), 

fci + 2/c 2 + h m/c m = n I , 

and 



(A;i + 2fc 2 H Vmk m )\ , yj 

^fco,..,* m - 1 11 



1 



vi=l 



Remark 3.4 By (6), we see that the expression 

m 

x + f(x)+ C *o,.,k m -[f(x)} ko -l[[f {i) (x)} ki , (27) 

(fco,...,fe m )eSjv i=l 

appearing in the theorem involves only derivatives of / of order N or smaller, 
and those derivatives are evaluated at x as are the derivatives in Lagrange's 
formula. Thus (27) is the truncated Lagrange formula described at the be- 
ginning of this section. 

Proof of the Theorem. As in Lemma 3.1, let P(x) be the Taylor poly- 
nomial of degree N for / at 0. The formula of Faa di Bruno tells us that 

00 1 / j \ n—l 

X 



1 /J \ «— 1 

n=l x ' 



= x + P(x) + ^ CV.., fem • [ P(x) } k ° • n [ P« (x) ] * . 

(fc ,...,fc m )es i=i 

Since P is a polynomial of degree N, we have pW(x) = whenever i > N. 
Thus we have 

m 

x + p(x) + ^ c fc0 ,..., fcm • [ p(x) r • n [ p {i) w } ki 

(ko,...,k m )eS i=l 

m 

= x+p(x)+ ^ ^...^-[p^^-nc^^)]^- 

(fc ,...,fc m )e3jv i=1 
9 



Applying Lemma 3.1, we see that 



o(x N ) = y - 



x 



00 1 (d xn - x 



n=l 



n\ V dx 



y- 



y - 



X 



X 



(fco,.--,fcm)es JV 



i=l 



(fcoj.-.jfcmjGSjv 



+ [/(x) - P(x)] 

+ ^ ] Cfco,...,fcm 

(feo,...,A; m )eSjv 

m m 

[/w] fc °-n[/ w w] fei -[^)] feo -ii[ pw w] fci 

i=l i=l 

Of course, we have f(x) — P(x) = o(x N ). So it will suffice to show that 

m m 

imr ■ n - [ p ^)f° ■ n i p( ^T = °(* n ) w 
i=i 1=1 

holds, for each (/c , • • • , k m ) G Sjv- 
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We rewrite the lefthand side of (28) as 



m— 1 



i=i i=i 

+•••+ 

i/wr-m^f • n [ pW (-)f 

i=l i=3+l 

j—1 m 

i=l i=7 

+ ••• + 



+ [/(x)]*°n [p^x)]** - [p(x)] ko n [pw(x)]' 



i=l i=l 

Thus it will suffice to show that 

i & i r n( a\ / \ i k 



holds, for each j = 1, 2, . . . , m for which fcj > 0, and that 

[/(x)] fc °-[P(x)] fe ° = (^). 
If j G {1,2,..., m} and fcj > holds, then we have 

[P^(x)] kj -[f^(x)] kj 



kj-l 

, ki-i-e 



= [P<i\x) - /&■)(*)] . £ [PW(,)]' [/^(^)] J = o(^) 

£=0 

Using f(x) = 0(x 1 ) and P^(x) - f®(x) = o{x N - j ), we obtain 
[f(x)] k ° ([P«\x)] k *-[f^(x)] kj )=o(x N -3 +k °). 
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By the definition of k and using n — k\ + 2k 2 + ••• + % + ••• + mk m , we 
have 



N-j + k 

= N - j + (n + 1) - (h + k 2 H h % H h k m ) 

= N - j + 1 + k 2 + 2k 3 + ■ ■ ■ (j - \)kj + • • • + (m - l)k m 

= N+(j-l)(k j -l)+ (t-l)ki>N. 

l = 2,....m 
^3 

Finally, we observe that 

im] ko -[p(x)} k ° = (/(x)-p(x))^[/(x)r[p(x)]^- 1 

i=i 

= o(x N ). a 
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